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NONLINEAR  OPTIMAL  TRAJECTORIES  USING  SUCCESSIVE  LINEARIZATION* 

Calvin  Hecht** 

The  Aerospace  Corporation 
El  Segundo,  California 


Abstract 


A method  of  designing  trajectories  for  maneu- 
vering reentry  vehicles  is  developed  in  this  paper. 
Optimal  control  theory  using  linear  perturbation 
guidance  and  a quadratic  penalty  function  is 
employed  to  design  trajectories  for  these  vehicles. 
The  vehicle  equations  of  motion  are  represented 
by  a five-dimension  state  vector.  The  problem  is 
formulated  by  linearizing  about  a reference  tra- 
jectory. A linear  servomechanism  problem, 
rather  than  a regulator  problem,  is  solved,  lead- 
ing to  a Riccati  matrix  differential  equation  and  an 
auxiliary  vector  differential  equation  which  are 
solved  backwards  in  time.  Successive  lineariza- 
tion consists  of  iteratively  generating  new  trajec- 
tories by  linearizing  about  a reference  trajectory 
created  from  a prior  iteration.  The  cost  function 
is  adjusted  between  iterations  to  shape  the  trajec- 
tory, Qualitative  guidelines  for  selecting  these 
cost  functions  are  given.  Teat  cases  are  shown, 
with  emphasis  on  designing  a trajectory  to  inter- 
cept a target  point. 


Introduction 


The  objective  of  this  paper  is  to  investigate  the 
application  of  optimal  control  theory  to  design 
trajectories  for  maneuvering  reentry  vehicles. 

The  trajectory  must  satisfy  specified  conditions, 
and  possibly  be  optimal  in  some  sense.  A general 
formulation  of  the  problem  leads  to  the  Euler - 
Lagrange  equations,  which  are  two-point  boundary 
value  problems,  or  the  Hamilton-Jacoby-Bellman 
partial  differential  equation. 

Much  of  the  recently  published  research  related 
to  maneuvering  reentry  vehicles  is  concerned  with 
determining  an  optimal  control  to  enable  the 
vehicle  to  follow  a prescribed  trajectory.  . The 
methods  used  for  solving  these  trajectory-following 
problems  are  variations  of  the  perturbation  guid- 
ance schemes  discussed  in  Bryson  and  Ho  and  in 
various  other  texts.  The  basic  perturbation 
guidance  scheme  is  concerned  with  time-varying 
linear  systems  with  quadratic  performance  criteria 
and  small  deviations  from  a nominal  solution. 

The  specified  conditions  for  an  optimal  trajec- 
tory are  given  in  terms  of  terminal  conditions. 

For  accuracy  trajectories,  the  ‘rnpact  point  and 
final  flight  path  angle  are  specified.  Efficient 
flight  path  shaping  leads  to  steeper  terminal  flight 
path  angles,  which  reduce  navigation  errors. 

Range  extension  and  evasion  trajectories  can  also 
be  specified  by  the  terminal  conditions.  In  each 
case,  the  initial  conditions  arc  fixed.  The  one 
variable  that  needs  attention  throughout  the  entire 
flight  is  the  control  vector,  since  its  magnitude  is 
required  to  not  exceed  specified  values  and  it  plays 
an  important  role  in  determining  total  flight  time 
and  final  velocity. 


Specifying  only  terminal  conditions  allowed  the 
use  of  an  extended  linear  perturbation  approach 
with  quadratic  performance  criteria.  The  desired 
trajectory  is  significantly  different  from  an  initially 
available  nominal  or  reference  trajectory.  There- 
fore, an  iterative  approach,  or  a sequence  of  tra- 
jectories, can  be  developed,  each  trajectory  being 
a reference  nominal  for  the  subsequent  iteration. 

The  iterative  method  of  successively  generating 
and  linearizing  about  nonlinear  trajectories  as 
described  in  this  paper  permits  penalty  factors  of 
the  quadratic  cost  function  to  be  adjusted  between 
iterations  for  the  purpose  of  shaping  the  final  tra- 
jectory to  meet  the  required  goals  and  to  ensure 
convergence  if  necessary.  The  method  of  succes- 
sive linearization  is  similar  to  the  method  of 
quasilinearization,  or  generalized  Newton -Raphson 
approach.  There  is  an  advantage  for  the  method 
described  in  this  paper,  however:  direct  access  of 
intermediate  computations  is  available,  enabling 
the  iterative  adjustments  of  the  problem  parameters. 

The  second  section  of  this  paper  briefly  sum- 
marizes the  applicable  linear  perturbation  theory, 
including  the  servomechanism  technique.  The 
nonlinear  and  linearized  equations  of  motion  are 
given,  and  the  section  concludes  with  a more 
detailed  description  of  the  successive  linearization 
method.  The  third  section  contains  a discussion  of 
the  evaluation  of  the  weighting  matrices.  The  last 
section  contains  descriptions  of  test  cases  to 
further  illustrate  the  method  and  to  demonstrate  its 
usefulness. 


Linear  Perturbation  Theory 
Linear  Servomechanism  Technique 


The  nonlinear  differential  equations  describing 
the  vehicle  motion  are  given  by 

X = f{x,  u)  (1) 

with  X the  state  vector,  u the  control  vector,  and  f 
a vector  valued  function  of  the  state  and  control. 
Linearizing  about  a nominal  state  and  control  func- 
tion, XQ(t)  and  Uo(t),  and  defining 

6x  = X - Xo  j 

(-1) 

6u  = u - u 1 
o I 

gives 

5x  = F 6x  + G 6u  (3) 


F and  G are  time-varying  Jacobian  matrices  whose 
elements  1^^  and  g^j  are  given  by 


3^ 
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Sf.(3C,  U)\ 

‘ij  = -0^1 

3f^(x.  u)J 
«ij  = 3u.  j 

where  iu  xii  and  uj  are  the  componente  of  their 
respective  Vectors,  and  where  fjj  2und  are 
evaluated  along  a nominal  or  reference  trajectory. 


1.  Generate  a nominal  trajectory  Xo(t)  with  a 
nominal  control  UQ{t),  and  compute  and  store  the 
F(t)  and  G(t)  matrices. 

2.  Initialize  P and  g at  the  terminal  time, 
according  to  Eqs.  (7)  and  (8),  smd  integrate  back- 
wards from  tf  to  to.  Compute  and  store  the  gain 
matrix 

K = R'^  G'P  (9) 


The  objective  of  the  optimization  process  is  to 
generate  a trajectory  which  has  specified  desirable 
properties.  For  the  reentry  vehicle  trajectory 
problems  discussed  in  this  paper,  the  teririinal 
conditions  are  specified.  Designating  this  desired 
terminal  state  iiUf)>  we  formulate  the  problem  by 
minimizing  the  quadratic  cost  function 

J = |•|||6x(tp  - -x^(tj))j|2 

+ j[^IM‘)||Q+|l6u(t)|||dt|  <5) 

The  first  term  of  Eq.  (5)  represents  the  penalty 
for  a deviation  from  the  desired  terminal  state  at 
the  final  time.  The  elements  of  the  S matrix, 
therefore,  would  be  much  larger  than  the  corres- 
ponding elements  of  the  Q matrix.  The  first  term 
inside  of  the  integral  sign  represents  a penalty  for 
deviations  of  the  perturbed  trajectory  from  the 
nominal  trajectory.  Since  the  desired  trajectory 
is  not  necessarily  close  to  the  nominal,  Q would 
normally  be  small.  However,  this  term  needs 
some  weighting  to  keep  the  perturbed  trajectory 
close  to  the  nominal  to  allow  the  linear  perturba- 
tion approximations  to  be  valid.  The  second  term 
inside  of  the  integral  sign  represents  the  penalty 
for  control  variations  about  the  nominal,  and  needs 
to  be  weighted  to  ensure  that  the  control  does  not 
exceed  specified  bounds. 

A solution  to  the  minimization  problem  stated 
in  Eq.  (5)  is  given  in  Sage,  ^ Page  96,  or  Bryson 
and  Ho,  4 Chapter  5.  The  solution  is 

6u  = - R'^  G'(P  6x  - e)  (6) 


and  control  vector 

k = R'^  g'I  (10) 

3,  Initialize  x(to)  and  integrate  Eq.  (1)  forward 
in  time  with 

u=-K6x+k  + u (11) 

o 

with  6x  = X - Xq  to  obtain  x(t),  a nonlinear  solution, 
and  an  optimal  linearized  control  u(t). 

System  Description,  Dinearized 
Equations  of  Motion 

The  coordinate  system  used  to  describe  the 
equations  of  motion  was  centered  on  the  surface  of 
a flat,  nonrotating  earth,  with  x pointing  down- 
range,  y crossrange,  and  z positive  upward  along 
the  local  vertical  and  passing  through  the  vehicle 
at  time  equals  zero.  The  point  mass  representa- 
tion of  the  vehicle  is  shown  in  Figure  1.  The 
state  equations  were 


= s cos  Y cos  if* 

(12) 

s s cos  Y sin  4^ 

(13) 

= - s sin  Y 

(14) 

= qA  “1 

(15) 

^m  s cos  V 

= -a^ 

m 8 “2 

(16) 

with  the  matrix  P(t)  and  vector  4(t)  given  by 
p = - PF  - F'  P + PGR'^  G'  P - Q 
P(tf)  = S 

i (-  F'  + PGR"^  o')  i 

4(1^)  = 

where  prime  (')  denotes  transpose. 

Equation  (7)  la  the  same  Riccati  equation 
obtained  for  the  linear  regulator  problem,  and 
Eq.  (8)  la  an  auxiliary  equation  needed  for  the 
servomechanism  problem. 


? ALTHUDt 
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The  algorithm  for  solving  the  linearized  servo- 
mechanism problem  is  as  follows: 


Figure  1.  Coordinate  definition. 
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(17) 


with 


X,  y,z 


4^  I Y 


s 

Uj.  U2 

q 


q — C „(u , , 
^ m D'  1 ' 


- the  position  coordinates  of  the 
vehicle 

- two  angles  defining  the  velocity 
vector 

- vehicle  speed 

= control  components 

2 

= dynamic  pressure  = (l/2)ps 


^45  = Cj  8 Uj  exp  (“z/hg) 

cos  Y 

Si  ^1  ^ ‘^i  °^p(-^/^)fr 

s 

g41  = Cj  s exp  (-z/h^)  ^3^ 

8^2  — <^1  8 exp(-z/h^) 

cont 

with  the  constant  Cj  - (1/2)  Cj^(A/m)p^. 

The  lie  sarlzed  perturbation  differential  equation 
is  give  . b/  Eq.  (3)  with 


A/m  = reference  area  to  mass  ratio 

Cl,  = slope  of  the  lift  coefficient 

Cd(uj,U2)  = nonlinear  drag  coefficient 

p = air  density  = Pq  exp  (-z/hg) 

pQ,  hg  = constants  defining  air  density 

approximation 

The  control  components  and  u^  are  the  com- 
ponents of  angle  of  attack.  The  direction  of  the 
force  associated  with  angle  of  attack  is  shown  in 
Figure  1.  This  model  is  applicable  to  either  a 
cruciform  configuration  vehicle  or  a bank-to-turn 
vehicle  with  negligibly  small  autopilot  time 
constants. 

There  is  no  direct  control  of  the  speed.  The 
state  modal  X was  therefore  represented  by  the 
five -state  vector 


"O 

0 

0 

^4 

'15' 

0 

0 

0 

^24 

^25 

F = 

0 

0 

0 

0 

^35 

0 

0 

*^43 

0 

^45 

_0 

c 

^53 

0 

0 - 

“ 0 

0 

0 

0 

G 

= 

0 

0 

g41 

0 

0 

OQ 

1 

(19) 


(2  0) 


The  gain  matrix  K,  Eq.  (9),  and  control  vec- 
tor k,  Eq,  (10),  are,  respectively,  a (5  x 2) 
matrix  and  a two -dimension  vector. 
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Successive  Linearization  Technique 


X n z 


‘I' 


The  speed  equation  (17)  was  used  for  the  solution 
of  the  nonlinear  state  aquations  (12)  through  (16), 
but  was  not  considered  an  element  of  the  state 
vector  for  the  purpose  of  solving  the  linearized 
equations. 

The  F and  G matrices,  given  in  Eq«.  (3)  and 
(4)  for  the  linearized  model,  wore  dimensioned, 
respectively,  (5  x 5)  and  (5  x 2),  with  all  of  the 
elements  equal  to  zero  except  those  listed. 

fj,j  = - s cos  Y sin  Y 

fj  j = - s cos  ip  sin  Y 


The  successive  linearization  method  iteratively 
applies  the  algorithm  described  in  the  preceding 
section,  with  the  nominal  trajectory  and  control 
for  each  iteration  being  the  solution  of  the  prior 
iteration. 

The  aerodynamically  controlled  reentry  vehicle 
problems  discussed  in  this  paper  were  started 
with  the  nominal  control  UQ(t)  = 0.  After  an  initial 
reference  trajectory  is  generated  with  this  control, 
a perturbed  trajectory  is  generated  using  an 
optimal  controller  derived  from  the  linearized 
equations  and  the  initial  cost  function  penalty 
matrices.  When  this  first  pass  is  completed,  we 
have  a perturbed  reference  trajectory  Xf,(t)  and  a 
iiununal  u^^ft)  / 0,  A control,  u(t),  associated  witli 
a trajectory  X(t)  that  has  the  required  specified 
properties  can  be  obtained  by  a judicious  selection 
of  the  weighting  matrices  Q,  R,  and  S. 


^24  ' ® 7 

^25  ” ' * ^ 


The  process  is  illustrated  in  Figure  2.  The 
technique  for  the  selection  of  the  weighting 
matrices  is  described  In  the  next  section. 


fjg  - - 8 cos  Y 

£43  = - Cj  suj  exp(.z/l,^)-5P^ 

s 


The  gain  .natrix  K and  the  auxiliary  control 
vector  k for  each  iteration  are  related  to  the 
selected  weighting  matrices  Q,  R,  and  S;  Eqs.  (7) 
through  (tO).  A qualitative  relationship  between  the 
( 1 g)  weighting  matrices  and  the  defined  performance 
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Note:  I argument  is  digital  Idiscietel  time, 
I argument  is  continuous  time 


Figure  2.  Successive  linearization  technique. 


rule-of-thumb  method,  suggested  by  Bryson  and 
Ho,  ^ is  to  proportion  the  weights  according  to  the 
reciprocal  of  the  squares  of  the  maximum  accept- 
able deviation  from  the  nominal. 


S" 


1 


Q 


1 


n X maximum  acceptable  value  of 
x(tj)  x'  (tj) 

n X (tf  - to)x  maximum  acceptable 
value  of  x(t)  x'  (t) 


R = r X (tf  - t„)  X maximum  acceptable 
value  of  uU)  u'  (t). 

with  n the  state  vector  dimension,  r the  control 
vector  dimension,  and  (tf  - to)  the  limits  of  inte- 
gration of  the  cost  function,  Eq,  (5). 


The  trajectories  developed  using  the  method 
described  in  this  paper  were  not  necessarily 
concerned  with  minimizing  a deviation  from  a 
reference.  The  relationship  between  the  solu- 
tion and  the  weighting  matrices  is  roughly 
governed  by  the  inverse  square  law,  and  this 
information  was  used  to  aid  in  choosing  the  value 
for  the  weighting  matrices. 


The  qualitative  effect  of  the  weighting  matrices 
on  the  shape  of  the  final  trajectory  can  easily  be 
determined  by  referring  to  the  cost  function, 

Eq.  (5);  large  Q tending  to  hold  the  perturbed 
trajectory  close  to  the  reference;  large  R tending 
to  reduce  the  amount  of  control;  and  large  S tend- 
ing to  reduce  the  error  between  the  perturbed 
final  sUte  and  the  target  point.  Other  trajectory 
properties  have  been  specified,  which  also  must 
be  controlled  with  Q,  R,  S selection:  the  total 
time  of  flight  and  the  dnal  velocity.  These  rela- 
tionships are  shown  in  Table  1,  which  gives  quali- 
tative guidelines  for  adjusting  the  weighting 
matrices  from  one  iteration  to  the  next. 


Because  the  desired  trajectory  is  generally  not 
close  to  the  reference  trajectory,  Q should  be  as 
small  as  practical.  Q must  be  at  least  positive 
s emi -definite.  The  Q matrix  functions  as  a stabi- 
lizing controller,  and  generally  needs  small  posi- 
tive numbers  on  the  diagonal  to  ensure  a stable 
solution.  The  desired  trajectory  must  come  close 
to  the  target  point,  suggesting  large  positive  num- 
bers for  the  S matrix. 


criterion  can  be  developed.  The  weighting  matrices 
can  then  be  changed  between  iterations,  based  on 
the  measured  conditions,  to  achieve  the  desired 
performance.  Specific  methods  for  shaping  the 
trajectories  of  the  reentry  vehicle  are  shown  in 
the  sections  on  "Evaluation  of  the  Weighting 
Matrices"  and  the  "Test  Cases.  " 

Evsduation  of  Weighting  Matrices  Q,  R,  S 


The  performance  of  the  vehicle,  as  character- 
ized by  the  shape  of  the  final  trajectory,  is  deter- 
mined by  the  values  of  the  Q,  R,  and  S matrices. 
No  general  analytic  method  is  available  for  com- 
puting these  matrices  for  the  time  varying  system 
described.  Sworder  and  Wells^  demonstrate  an 
analytic  approach  which  can  bo  used  when  making 
certain  slmplylng  assumptions,  and  de  VlrglUo, 
Wells,  sjod  Schiring^  suggest  a procedure  for 
determining  these  matrices.  The  commonly  used 


Experience  has  shown  the  chief  influence  on  tlie 
shape  of  the  trajectory  is  the  relative  weighting  of 
the  elements  of  the  S matrix.  Essentially,  the  por- 
tairbed  trajectory  wUl  terminate  much  closer  to  the 
target  point  tlmn  the  nonmaneuverlng  Impact  point 
on  the  first  iteration.  The  relative  weights  on  the 
terminal  position  components  and  flight  path  angle, 
although  having  second  order  effects  on  the  final 
state,  appear  to  strongly  Influence  the  trajectory 
shape.  These  component  weightings  can  be  ade- 
quately accomplished  by  manipulating  the  diagonal 
components,  with  the  off-diagonal  elements  set 
equal  to  zero. 

The  control  vector  u is  subject  to  a direct  and 
an  Indirect  bard  constraint.  The  direct  constraint 
is  Unnax  u whore 


withlu|  u (Uj  + ^ . 
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Table  I.  Qualitative  relationahlp*  for  adjusting 
weighting  matrices. 


Matri.s  Adjustment 

Q 

R 

S 

Follow 

Int  ri‘a»c 

Inc  reAse 

Decrease 

Reference 

Intercept 

Decrease 

Decrease 

Increase 

Target 

Reduce 

Decrease 

Increase 

Decrease 

Control 

Dec  reaio 

Increase  if  per- 

Increase  if  per- 

Increase  altitude 

Tiine  of 

turbfd  t rajec- 

turbed  t rajec- 

component; 

Flight 

tory  IS  longer 

tory  is  longer 

decrease  hori- 

than  nonmaneu- 

than  nonmaneu- 

zontal  compo- 

vering  trajec- 

vering  trajec- 

nents.  Compo- 

lory;  decr^*a«e 

tory;  decrease 

nent  refc  ra  to 

otherwise. 

otherwise . 

diagonal  element 
of  matrix. 

Increase 

Increase  if  per- 

Increase  if  per- 

Increase  altitude 

Terminal 

turbed  trajec- 

turbed  trajec- 

component; 

Speed 

tory  terminal 

tory  terminal 

decrease  hori- 

• peed  ia  lower 

•peed  is  lower 

zontal  compo- 

than  nonmaneu- 

than  nonmaneu- 

nents.  Compo- 

vering  trajec- 

vering  trajec- 

nent  refers  to 

tory;  decrease 

tory:  decrease 

diagonal  clement 

otherwise. 

otherwise. 

of  matrix* 

The  control  must  also  limit  the  normal  accel- 
eration to  an  acceptable  value,  The  accel- 

eration is  given  by 


a = acceleration  in  e's 

g * 

q = dynamic  pressure 

= slope  of  the  lift  coefficient 
W/Ag=  IT, ass  to  reference  area  ratio 
g = gravity 

The  indirect  constraint  is 


U/qC. 


Then, 


u < u , u 

max  “ max  u max  a 


The  total  control  (nominal  plus  perturbed)  is 
tested  at  each  point  along  the  trajectory.  If  it 
exceeds  it  is  limited  as  shown  in  Figure  3, 

and  the  R matrix  is  modified  for  the  next  iteration 
according  to 


U, 


max 


*^8  U„ 


Figure  3.  Scaling  the  control  when 
exceeding  the  constraint. 


r. . 

11  new 


" i diagonal  element  of  R matrix 

’’ii  new  " diagonal  element  of  R matrix  for 
the  next  iteration 

u = u + 6u 

s o 

As  Jhown  in  Figure  3,  Ug  is  the  total  control  from 
the  linear  formulas  when  u^  + 6u  exceeds 

Test  Cases 

The  procedures  described  in  this  paper  were 
used  to  generate  trajectories  designed  to  achieve 
various  objectives.  Two  sample  cases  are  de- 
scribed in  this  section.  The  descriptions  are  for 
the  purpose  of  illustrating  the  technique  and  demon-;, 
strating  the  types  of  results  that  can  be  obtained.  | 
Numerical  results  are  given  in  terms  of  normal- 
ized data;  linear  distances  are  referenced  to  h,  f 
the  initial  altitude,  and  angles  are  referenced  to  .j 
® y,  the  maximum  angle  of  attack.  '•>  •; 

Specified  Target  and  Terminal  Flight  Path 
Angle  — No  Acceleration  Constraint 

A basic  trajectory  is  one  that  has  a fixed  initial  ; 
state,  impacts  at  a specified  location,  approaches 


Vehicle  parameters  and  aerodynamic  data  for  the  evaluation  of  the  trajectory  generation  scheme  were 
similar  to  the  examples  used  in  an  unclassified  report  with  unlimited  distribution.^ 
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l)tt>  Ui'HVl  AtohH  M l•|>v.H> till'd  dlvitUnD  (eoi'ii'itntil 
fttghi  |>Ml)i  AnDl«)i  And  intnlniluoH  uoiitrnl  afiort.  ^ 

'ri\A  nMninuim  (ontrol  iiKort  AldH  In  Achlnvlng 
till?  AddllInnAl  nbitinllvit«  of  InnvnAaiul  tni'ininal 
vaInnKv  And  intnlnwun  (lluht  ttmti  (hd*)  Tfthln  1), 
CnndUlona  <«»’  t)\l«  caan  »nd  unmpvitnci  roault*  aro 
llHt«d  In  Tallin  i,  Klmii'tiH  4 thr'inud  14  at'o 
iji)ni)Hit«i''(l«n«i'«ttid  (iliita  of  tvajnctnvy  data, 

Tl\ii  vviilulillni)  un  pualtlun  di>vlatli>n«  aloiiK  tlu> 
Irsjnulni'y  wan  iuu'o,  WolHhtlns  on  tho  «n«l<>» 
aUinn  thn  traji'i'toi'y  oorri>Mi>ondod  rcinnhly  to  do- 
vUtlona  o(  0,  i(H;  wi>lnhtlinj  on  tho  control  (angle 
of  attack)  cot'i'i'«|ioiulod  roughly  to  0,  d5,  with  a 
hard  conatralnt  of  1,0,  TornUnal  wolghtlngs  cor- 
rouiiondod  roughly  to  noNltlon  orrorn  of  0,003, 
vnry  largo  allowahio  amlinuth  dovUtion  and  a flight 
path  angle  error  of  0,  US, 

Klguro*  4,  h,  and  6 ahow  the  trajectories.  The 
lli'Ht  perturhed  trajectory  (Uoratlon  1)  came  fairly 
clone  to  tho  (Inal  trajectory,  Tho  terminal  flight 
path  angle  wa»  algnlflcantly  different  than  the  tar- 
get value  of  il,()17.  Subsequent  Iterations  improved 
the  trajectory  as  Indicated  in  Table  i.  After  the 
third  iteration,  the  difforonces  between  the  actual 
and  targeted  terminal  conditions  were  reasonably 
small!  downrango  error  - 0,00156,  crossrange 
error  » 0,  flight  path  angle  error  - 0.  006. 

The  flight  time  of  6,  135  sec  and  speed  of 
0,  ilH  I *) / sec  at  the  last  Iteration,  compared  to  the 
nonmaneuvering  values  of  3.  998  sec  and  0.  3925/ 
sec,  demonstrates  the  trade-off  between  the  ma- 
neuvering and  nonmanouvorlng  cases. 


Table  2.  Specified  target  — no  acceleration 
constraint 


Test  Conditions 


liuiiAl  Corwlitions: 

\ = Vo  ‘ “•  ‘o  ° "o/*'  " '■  ® 

*„/h  ' 0.  5Sl/»ec,  -iia  = 0,  Y„  ■=  1.  309 
Weighting  Matrices: 

■ ‘o>-  ''u  “ ‘'zz  “ <‘r  - 

SlI  • Sz2  “S33-^’<'0'^ 

All  Other  Elements  « 0 
Control  Constreint: 


Terminal 

Conditions 

Iteration  No. 

Target 

Condition 

0 

1 

2 

3 

s/l< 

1.  T3Z07 

2. 

S0B73 

2.  49833 

2.  49843 

2.  50000 

y /h 

0 

0. 

33220 

0. 33327 

0,  33333 

0.  33333 

v/» 

1.  309 

2. 

325 

2.  603 

2,  623' 

2.  617 

Tlmu,  sec 

3.  998 

6, 

119 

6.  125 

6.  125 

... 

Sps«d/h, 

0.  39Z5 

0. 

2924 

0. 2B7B 

0.  2819 

1 /•so 



Figure  4.  Trajectory  in  the  x-z  plane. 
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Figure  9.  An  element  of  the  P matrix. 
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Figure  10,  Horizontal  component  <jf 
auxiliary  control. 
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Figure  1 1 , Vertical  component  of 
auxiliary  control. 
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Figure  14,  Gain  element  1^25’ 

Improvements  In  flight  time  and  final  speed  could 
have  been  made  at  the  expense  of  larger  terminal 
errors  in  position  or  angle. 

Figures  7 and  8 show  the  control  functions. 

The  first  iteration  appears  rather  ragged  duo  to 
the  discretization  method  (sampling  of  0.  l-sec 
Intervals),  The  control  U|  is  the  angle  of  atl.ick 
in  the  horizontal  plane,  and  U2  the  angle  of  attack 
In  the  pitch  plane.  At  no  time  was  tlu-  I.  0 con- 
straint reached. 

Figure  9 shows  a typical  clement  P)  j of  the 
P matrix  In  the  solution  of  the  Riccall  lupialion. 
The  solution  goes  from  left  to  right,  with  the  lime 
axes  latioled  with  a minus  sign,  The  1’  matviN  ii- 
significantly  different  between  the  first  llerallon 
and  subsetiuent  iterations.  Figures  10  and  11 
show  the  soUillon  to  the  auxiliary  control  vector  k, 
The  fls'jt  solution  Is  very  largo  contpared  to  the 
subsetiuent  solutions,  as  might  be  expeeletl,  siiice 


Best  Available  Co- 


this  function  is  related  to  the  error  between  an 
interim  and  the  solution  trajectory. 

Figures  12  through  14  show  three  elements  of 
the  (2x5)  gain  matrix  K,  The  three  elements 
are  K21,  K23i  and  K25,  which  are  the  vertical 
control  gains  for  x,  z,  and  Y errors,  respectively. 
An  interesting  fact  is  that  the  gain  K21  is  mostly 
of  Opposite  polarity  for  Iterations  2 and  3 as  com- 
pared to  Iteration  1. 

Specified  Target  and  Terminal  Flight  Path  Angle  — 
Angle  of  AttaA  and  Acceleration  Constraint 

A more  realistic  test  casta  is  one  that  is  limited 
both  in  maximum  angle  of  attack  and  normal  accel- 
eration. An  in-plane  target  was  selected  to  exa- 
mine the  R matrix  adjusting  algorithm. 

Conditions  for  this  case  and  computed  results 
are  listed  in  Table  3.  Figures  15  through  18  are 
computer -generated  plots  of  trajectory  data. 

The  initial  weighting  matrices  were  the  same 
as  in  the  previous  case,  except  that  out-of-plane 
parameters  were  ignored.  Figures  15  and  16 
show  the  trajectories.  The  first  perturbed  tra- 
jectory came  fairly  close  to  the  terminal  target 
position,  but  the  terminal  flight  path  angle  error 
was  large  as  seen  in  Figure  16  and  Table  3,  An 
identical  case  had  previously  been  tried  with  no 
acceleration  constraint  (not  reported  herein),  and 
the  terminal  flight  path  angle  error  was  consid- 
erably smaller  on  the  first  iteration.  Subsequent 
iterations  improve  the  trajectory  as  indicated  in 
Table  3.  After  the  fourth  iteration,  the  differences 
between  the  actual  and  targeted  terminal  conditions 
were  small:  downrange  error  = 0.  00073,  flighfpath 


Table  3.  Specified  target  — acceleration 
constraint 


Test  Conditions 


Initial  Conditions: 

= 0,  Yo  = 0,  Zg/h  = 1.0 

0.  593  / sec 
Weighting  Matrices: 


Q44  = = 20/(t,  - ^ 0.01  (tj  - 1^) 


fi-5 


All  Other  Elements  » 0 
Control  Constraint: 


u__  =1.0  u /o  * 0,  1 6 at  t = 0 

max  u max  a 


Iteration  No. 


Conditions 

0 

1 

2 

3 

4 

Condition 

x/h  ft 

1. 73207 

2. 50377 

2.  50530 

2, 50133 

2. 50073 

2. 5000 

y/h  ft 

0 

0 

0 

0 

0 

0 

V/o  rad 

1.  309 

1,  713 

1. 864 

1, 921 

1. 939 

1.  950 

TimOi  sec 

3.998 

5.  856 

5.  871 

5.  874 

5.  876 

■ 

Speed/hi 

1 /sec 

0,  3925 

0. 33307 

0.  3313 

0.3310 

0,  3310 

- 

Figure  15.  Trajectory  in  the  x-z  plane, 
constraint  problem. 


Figure  16.  Flight  path  angle, 

constraint  problem. 


Figure  17.  Vertical  control,  constraint 
problem. 


Figure  18. 


An  element  of  the  R matrix, 
Rll  = R22- 
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angle  error  : 0,011,  Referring  to  Table  3,  we 
aee  the  final  flight  path  angle  approaching  the 
targeted  value  with  very  email  changes  of  flight 
time  or  final  apoed  showing  the  effectiveness  of 
the  R weighting  matrix. 

Figure  17  shows  the  control  function.  Because 
of  0,  16  constraint,  the  magnitude  Of  U2  is 
limited  to  about  0,  16  at  the  start  and  again 
toward  the  end  of  the  flight.  As  R is  adjusted 
between  iterations,  the  boundary  riding  portion  of 
the  trajectory  is  extended,  from  about  1,0  to  1.2 
sec  at  the  start  and  from  5.4  to  4.8  sec  toward 
the  end.  We  can  clearly  see  the  control  going 
from  a proportional  controller  to  a bang-bang 
type,  which  is  known  to  be  optimal  for  certain 
types  of  problems.  Figure  18  shows  one  element 
of  the  R matrix  for  this  case.  Initially  R is  con- 
stant, about  25  (RjJ  = 0.01  (4)).  After  the  first 
iteration,  we  see  R growing  at  the  extremities  of 
the  trajectory,  where  the  proportional  controller 
is  requesting  a control  magnitude  greater  than  the 
prescribed  limit. 

Conclusions 

This  paper  demonstrates  that  reentry  trajec- 
tories can  be  designed  using  linear  optimal  con- 
trol theory  and  a method  of  successive  lineariza- 
tion. A unique  feature  of  successive  linearization 
is  the  ability  to  adjust  the  weighting  matrices 
between  iterations  to  shape  the  trajectory  to  meet 
specified  requirements.  An  algorithm  for  com- 
puting values  of  the  R weighting  matrix  gives  an 
improvement  over  the  trial-and-error  methods 
that  have  previously  been  used, 

A more  general  aspect  of  the  paper  is  the  devel- 
opment of  a method  of  applying  optimal  control 
theory  to  nonlinear  problems.  We  have  not  proved 
that  the  solutions  using  successive  linearizations 
will  converge.  When  they  do  converge,  we  have 


not  proved  that  the  solution  is  optimal  in  any 
sense.  We  have  shown,  however,  for  the  cases 
considered,  that  by  proper  choices  of  the  penalty 
function  weighting  matrices  a solution  can  be 
made  to  converge  to  a trajectory  having  desirable 
specified  terminal  properties. 
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